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Abstract 

Consistent query answering is an inconsistency tolerant approach to obtaining semantically correct an- 
swers from a database that may be inconsistent with respect to its integrity constraints. In this work 
we formaHze the notion of consistent query answer for spatial databases and spatial semantic integrity 
^ \ constraints. In order to do this, we first characterize conflicting spatial data, and next, we define admissi- 

ble instances that restore consistency while staying close to the original instance. In this way we obtain 
a repair semantics, which is used as an instrumental concept to define and possibly derive consistent 
query answers. We then concentrate on a class of spatial denial constraints and spatial queries for which 
there exists an efficient strategy to compute consistent query answers. This study applies inconsistency 
■ tolerance in spatial databases, rising research issues that shift the goal from the consistency of a spatial 

\ database to the consistency of query answering. 
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O ■ 1 Introduction 

Consistency in database systems is defined as the satisfaction by a database instance of a set of integrity con- 
straints (ICs) that restricts the admissible database states. Although consistency is a desirable and usually 
^ I enforced property of databases, it is not uncommon to find inconsistent spatial databases due to data inte- 

^ ■ gration, unforced integrity constraints, legacy data, or time lag updates. In the presence of inconsistencies, 

- - -' there are alternative courses of action: (a) ignore inconsistencies, (b) restore consistency via updates on the 

database, or (c) accept inconsistencies, without changing the database, but compute the "consistent or cor- 
rect" answers to queries. For many reasons, the first two alternatives may not be appropriate [^, specially 
in the case of virtual data integration [i5J, where centralized and global changes to the data sources are not 
allowed. The latter alternative has been investigated in the relational case EKTOl. In this paper we explore 
this approach in the spatial domain, i.e., for spatial databases and with respect to spatial semantic integrity 
constraints (SICs). 

Extracting consistent data from inconsistent databases could be qualified as an "inconsistency tolerant" 
approach to querying databases. A piece of data will be part of a consistent answer if it is not logically 
related to the inconsistencies in the database with respect to its set of ICs. We introduce this idea using an 
informal and simple example. 



* Faculty Fellow of the IBM Center for Advanced Studies. Also affiliated to Universidad de Concepcion, Chile. 
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Example 1 Consider a database instance with a relation LandP, denoting land parcels, with a thematic 
attribute {idl), and a spatial attribute, geometry, of data type polygon. An IC stating that geometries of 
two different land parcels must be disjoint or just touch, i.e., they cannot internally intersect, is expected to 
be satisfied. However, the instance in Figure [T] does not satisfy this IC and therefore it is inconsistent: the 
land parcels with idls idl2 and idl^, overlap. Notice that these geometries are partially in conflict and what 
is not in conflict can be considered as consistent data. 
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Figure 1: An inconsistent spatial database 



Suppose that a query requests all land parcels whose geometries intersect with a query window, which 
represents the spatial region shown in Figured] as a rectangle with dashed borders. Although the database 
instance is inconsistent, we can still obtain useful and meaningful answers. In this case, only the intersection 
of g2 and gs is in conflict, but the rest of both geometries can be considered consistent and should be part 
of any "database repair" if we decide to restore consistency by means of minimal geometric changes. Thus, 
since the non-conflicting parts of geometries g2 and gs intersect the query window, we would expect an 
answer including land parcels with identities idl^ , idl2 and idls. ^ 

If we just concentrate on (in)consistency issues in databases (leaving aside consistent query answering for 
a moment), we can see that, in contrast to (in)consistency handling in relational databases, that has been 
largely investigated, not much research of this kind has been done for spatial databases. In particular, there 
is not much work around the formalization of semantic spatial ICs, satisfaction of ICs, and checking and 
maintenance of ICs in the spatial domain. However, some papers address the specification of some kinds of 
integrity constraints [|8l|20l, and checking topological consistency at multiple representations and for data 
integration [[T3l[T4l[3ll. 

More recently, [fT2l proposes qualitative reasoning with description logic to describe consistency be- 
tween geographic data sets. In [l22l a set of abstract relations between entity classes is defined; and they 
could be used to discover redundancies and conflicts in sets of SICs. A proposal for fixing (changing) 
spatial database instances under different types of spatial inconsistencies is given in [|29l . According to it, 
changes are applied over geometries in isolation; that is, they are not analyzed in combination with multiple 
SICs. In |[27]| some issues around query answering under violations of functional dependencies involving 
geometric attributes were raised. However, the problem of dealing with an inconsistent spatial database, 
while still obtaining meaningful answers, has not been systematically studied so far. 

Consistent query answering (CQA) from inconsistent databases as a strategy of inconsistent tolerance 
has an extensive literature (cf. H |6l [TOl for surveys). It was introduced and studied in the context of 
relational database in [2]. They defined consistent answers to queries as those that are invariant under all the 
minimal forms of restoring consistency of the original database. Thus, the notion of repair of an instance 
with respect to a set of ICs becomes a fundamental concept for defining consistent query answers. A repair 
semantics defines the admissible and consistent alternative instances to an inconsistent database at hand. 
More precisely, a repair of an inconsistent relational instance D is a consistent instance D' obtained from 
D by deleting or inserting whole tuples. The set of tuples by which D and D' differ is minimal under set 
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inclusion O. Other types of repair semantics have been studied in the relational case. For example, in 
|[T6l[32]l repairs are obtained by allowing updates of attribute values in tuples. 

In comparison to the relational case, spatial databases offer new alternatives and challenges when defin- 
ing a repair semantics. This is due, in particular, to the use of complex attributes to represent geometries, 
their combination with thematic attributes, and the nature of spatial (topological) relations. 

In this work we define a repair semantics for spatial databases with respect to a subset of spatial semantic 
integrity constraints (a.k.a. topo-semantic integrity constraints) l|29l, which impose semantic restrictions on 
topological predicates and combinations thereof. In particular, we treat spatial semantic integrity constraints 
that can be expressed by denials constraints. For example, they can specify that "two land parcels cannot 
internally intersect". This class of constraints are neither standardized nor integrated into current spatial 
database management systems (DBMSs); they rather depend on the application, and must be defined and 
handled by the database developers. They are very important because they capture the semantics of the 
intended models. Spatial semantic integrity constraints will be simply called spatial integrity constraints 
(SICs). Other spatial integrity constraints fTTl are domain (topological or geometric) constraints, and they 
refer to the geometry, topology, and spatial relations of the spatial data types. One of them could specify that 
"polygons must be closed". Many of these geometric constraints are now commonly integrated into spatial 
DBMSs 121. 

A definition of a repair semantics for spatial DBs and CQA for spatial range queries was first proposed 
in ll28]| . where we discussed the idea of shrinking geometries to solve conflicting tuples and applied to 
CQA for range queries. In this paper we complement and extend our previous work with the following 
main contributions: (1) We formalize the repair semantics of a spatial database instance under violations 
of SICs. This is done through virtual changes of geometries that participate in violations of SICs. Unlike 
ll28l . we identify the admissible local transformations and we use them to provide an inductive definition 
of database repair. (2) Based on this formalization, a consistent answer to a spatial query is defined as an 
answer obtained from all the admissible repairs. Extending the results in ll28l . we now define CQA not only 
for range but also for spatial join queries. (3) Although the repair semantics and consistent query answers 
can be defined for a fairly broad class of SICs and queries, as it becomes clear soon, naive algorithms for 
computing consistent answers on the basis of the computation of all repairs are of exponential time. For 
this reason, CQA for a relevant subset of SICs and range and join queries is done via a core computation. 
This amounts to querying directly the intersection of all repairs of an inconsistent database instance, but 
without actually computing the repairs. We show cases where this core can be specified as a view of the 
original, inconsistent database. (4) We present an experimental evaluation with real and synthetic data sets 
that compares the cost of CQA with the cost of evaluating queries directly over the inconsistent database 
(i.e., ignoring inconsistencies). 

The rest of the paper is organized as follows. In Section |2] we describe the spatial data model upon 
which we define the repair semantics and consistent query answers. A formal definition of repair for spatial 
inconsistent databases under SICs is introduced in Section [51 In Section |4] we define consistent answers 
to conjunctive queries. We analyze in particular the cases of range and join queries with respect to their 
computational properties. This leads us, in Section |5l to propose polynomial time algorithms (in data com- 
plexity) for consistent query answering with respect to a relevant class of SICs and queries. An experimental 
evaluation of the cost of CQA is provided in Section |6l Final conclusions and future research directions are 
given in Section |7l 
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2 Preliminaries 



Current models of spatial database are typically seen as extensions of the relational data model (known as 
extended-relational or object-relational models) with the definition of abstract data types to specify spatial 
attributes. We now introduce a general spatio-relational database model that includes spatio-relational pred- 
icates (they could also be purely relational) and spatial ICs. It uses some of the definitions introduced in 
|[25]| . The model is independent of the geometric data model (e.g. Spaghetti [301, topological |[l8l[30l, raster 
HH, or polynomial model ll24]| ) underlying the representation of spatial data types. 

A spatio-relational database schema is of the form H = {U.A^TZ^T^O^B), where: (a) U is the possibly 
infinite database domain of atomic thematic values, (b) ^ is a set of thematic, non-spatial, attributes, (c) 
7^ is a finite set of spatio-relational predicates whose attributes belong to A or are spatial attributes. Spatial 
attributes take admissible values in V(MJ^), the power set of R^, for an m that depends on the dimension 
of the spatial attribute, (d) T is a fixed set of binary spatial predicates, with a built-in interpretation, (e) 
(9 is a fixed set of geometric operators that take spatial arguments, also with a built-in interpretation, (f) B 
is a fixed set of built-in relational predicates, like comparison predicates, e.g. <,>,=, 7^, which apply to 
thematic attribute values. 

Each database predicate R e 71 has a type r{R) = [n, m], with n, m E N, indicating the number n of 
thematic attributes, and the spatial dimension m of the single spatial attribute (it takes values in P(M^))Q In 
Example[Il r(LandP) = [1, 2], since it has one thematic attribute (idl) and one spatial attribute (geometry) 
defined by a 2D polygon. In this work we assume that each relation R has a key of the form ([T) formed by 
thematic attributes only: 

\/xiX2XsSiS2 X2; Si) A X3; 52) {X2 = X3 A 5i = 52)), (1) 

where the Xi are sequences of distinct variables representing thematic attributes of R, and the Si are variables 
for geometric attributes. Here si = 52 means geometric equality; that is, the identity of two geometries. 

A database instance D of a spatio-relational schema H is a finite collection of ground atoms (or spatial 
database tuples) of the form i?(ci, c^; 5), where i? E 7^, (ci, c^) E ZY^ contains the thematic attribute 
values, and s e Ad C V{W^), where Ad is the class of admissible geometries (cf. below). The extension 
in a particular instance of a spatio-relational predicate is a subset ofU^ x Ad. For simplicity, and to fix 
ideas, we will consider the case where m = 2. 

Among the different abstraction mechanisms for modelling single spatial objects, we concentrate on 
regions for modelling real objects that have an extent. They are useful in a broad class of applications in 
Geographic Information Systems (GISs). More specifically, our model will be compatible with the specifi- 
cation of spatial operators (i.e., spatial relations or geometric operations) as found in current spatial DBMSs 
II23I . Following current implementations of DBMSs, regions could be defined as finite sets of polygons that, 
in their turn, are defined through their vertices. This would make regions finitely representable. However, in 
this work geometries will be treated at a more abstract level, which is independent of the spatial model used 
for geometric representation. In consequence, an admissible geometry of the Euclidean plane is either the 
empty geometry, g^, which corresponds to the empty subset of the plane, or is a closed and bounded region 
with a positive area. It holds g^ D g = g D g^ = g^, ior every region g. From now on, empty geometries 
and regions of are called admissible geometries and they form the class Ad. 

Geometric attributes are complex data types, and their manipulation may have an important effect on 
the computational cost of certain algorithms and algorithmic problems. As usual, we are interested in data 

^For simplicity, we use one spatial attribute, but it is not difficult to consider a greater number of spatial attributes. 
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complexity, i.e., in terms of the size of the database. The size of a spatio-relational database can be defined 
as a function of the number of tuples and the representation size of geometries in those tuples. 

We concentrate on binary (i.e., two-ary) spatial predicates that represent topological relations between 
regions. They have a fixed semantics, and become the elements of T. There are eight basic binary rela- 
tions over regions of M^: Overlaps (OV), Equals (EQ), CoveredBy {CB), Inside (IS), Covers (CV), 
Includes {IC), Touches {TO), and Disjoint {DJ) |[T21[26lIl The semantics of the topological relations 
follows the point-set topology defined in [15], which is not defined for empty geometries. We will apply 
this semantics to our non-empty admissible geometries. For the case of the empty set, a separate definition 
will be given below. According to ifTSl . an atom T(x, y) becomes true if four conditions are simultaneously 
true. Those conditions are expressed in terms of emptyness (0) and non-emptyness (^0) of the intersection 
of their boundaries {5) and interiors (o). The definitions can be found in Table [H For example, for non- 
empty regions TO{x^y) is true iff all of 5{x) H 5{y) ^ 0, o(x) H o{y) = 0, 5{x) H o(^) = 0, and 
o(x) n 5{y) = simultaneously hold. 

Relation 5{x) n 5{y) o(x) n o(^) 5{x) n o{y) o(x) n 5{y) 



DJ(x,y) 

TO(x,y) ^0 

EQ(x,y) -0-0 

IS(x,y) 0-0-0 

CB(x,y) -0 -0 -0 

IC(x,y) 0-0 0-0 

CV(x,y) -0 -0 -0 

OV(x,y) -0 -0 -0 -0 



Table 1: Definition of topological relations between regions based on point-set topology 

In this work we exclude the topological relation Disjoint from T. This decision is discussed in Section [51 
where we introduce the repair semantics. In addition to the basic topological relations, we consider three 
derived relations that exist in current SQL languages and can be logically defined in terms of the other 
basic predicates: Intersects {IT), Within { WI), and Contains {CO). We also introduce a forth relation, 
Ilnter sects (II), that holds when the interiors of two geometries intersect. It can be logically defined as 
the disjunction of Overlaps, Within and Contains (cf. Figure |2]). For all the topological relations in T, 
their converse (inverse) relation is within the set. Some of them are symmetric, like Equals, Touches, and 
Overlaps. For the non- symmetric relations, the converse relation of CoveredBy is Covers, of Inside is 
Includes, and of Within is Contains. 

As mentioned before, the formal definitions of the topological relations (TSl [26l do not consider the 
empty geometry as an argument. Indeed, at the best of our knowledge, no clear semantics for topological 
predicates with empty geometries exists. However, in our case we extent the definitions in order to deal 
with this case. This will allow us to use a classical bi-valued logic, where atoms are always true or false, 
but never undefined. According to our extended definition, for any T E T, T(^i, ^2) is false if gi = or 
92 = 90' In particular, IS{g^ ^0) is false, for every admissible region g. In order to make comparisons with 
the empty region, we will introduce and use a special predicate IsEmpty{-) on admissible geometries, such 

^The names of relations chosen here are in agreement with the names used in current SQL languages [23 1, but differ slightly 
from the names found in the research literature. The relations found in current SQL languages are represented in Figure |2] with 
thick borders. 
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Figure 2: Subsumption lattice of topological relations between regions: OV (Overlaps), CB (CoveredBy), 
IS (Inside), EQ (Equals), CV (Covers), IC (Includes), TO (Touches), DJ (Disjoint), IT (Intersects), II (Iln- 
tersects), WI (Within), and CO (Contains). 



that IsEmpty{s) is true iff s = ^q. 

Notice that the semantics of the topological predicates, even for non-empty regions, may differ from the 
intuitive set-theoretic semantics one could assign to them. For example, for an admissible and non-empty 
geometry g, OV{g^g) is false (due to the conditions in the last two columns in Table [T]). In consequence, 
the constraint VxV5^^0^(i?(x; s) A OV{s, s)) is satisfied. 

Given a database instance, additional spatial information is usually computed from the explicit geometric 
data by means of the spatial operators in O associated with S. Some relevant operators are: Area, Union 
(binary). Intersection, Difference, Buffer, and Union Aggregation {GeomUnion)^ (Cf. |[23]| for the 
complete set of spatial predicates defined within the Open GIS Consortium.) There are several spatial 
operators used in this work; however, we will identify a particular subset of spatial operators in O, i.e., 

C O, which will be defined for all admissible geometries and used to shrink geometries with the purpose 
of restoring consistency, as we describe in Section |3] 

Definition 1 The set of admissible operations contains the following geometric operations on admissi- 
ble geometries g and g': 

(1) Difference{g ^ g') is the topological closure of the set-difference. 

(2) Buffer {g^ d) is the geometry obtained by buffering a distance d around g, where d is a distance unit. 
Buffer {g^ d) returns a closed region g containing geometry g, such that every point in the boundary of g is 
at a distance d from some point of the boundary of g. In particular. Buffer {g^, d) = g^. □ 

Notice that these operators, when applied to admissible geometries, produce admissible geometries. 

Remark 1 The value of d in Definition [T] is instance dependent. It should be precomputed from the spatial 
input data. For this work, we consider d to be a fixed value associated with the minimum distance between 
geometries in the cartographic scale of the database instance. □ 

^Operator GeomUnion returns the geometry that represents the point set union of all geometries in a given set, an operator 
also known as a spatial aggregation operator. Although this function is part of SQL for several spatial databases (Postgres/PostGIS, 
Oracle), it is not explicitly defined in the OGC specification 1231 . 
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Figure 3: A spatial database instance 



A schema H determines a many-sorted, first-order (FO) language C{T>) of predicate logic. It can be used 
to syntactically characterize and express SICs. For simplicity, we concentrate on denial SICs^ which are 
sentences of the form: 

m n 

V^^0 5Vx-(/\i?,(x,;5,) A(/pA /\Tj{vj,Wj)). (2) 

i=i j=i 

Here, s = si • • • s^, x = xi • • • are finite sequences of geometric and thematic variables, respectively, 
and < m, n G N. Thus, each Xi is a finite tuple of thematic variables and will be treated as a set of 
attributes, such that Xi C xj means that the variables in Xi area also variables in xj. Also, Vx stands for 
Vxi • • • Vx^; and V^^^ 5 stands for Vsi • • • Vs^, with the universal quantifiers ranging over all the non-empty 
admissible geometries (i.e. regions). Here, vj.wj G 5, . . . , i?^ G 7^, (/^ is a formula containing built-in 
atoms over thematic attributes, and Tj E T. A constraint of the form ^ prohibits certain combinations of 
database atoms. Since topological predicates for empty geometries are always false, the restricted quantifi- 
cation over non-empty geometries in the constraints could be eliminated. However, we do not want to make 
the satisfaction of the constraints rely on our particular definition of the topological predicates for the empty 
region. In this way, our framework becomes more general, robust and modular, in the sense that it would 
be possible to redefine the topological predicates for the empty region without affecting our approach and 
results. 

Example 2 Figure [3] shows an instance for the schema TZ = {LandP{idl, name^ owner; geometry). 
Building (idb; geometry)}. Dark rectangles represent buildings and white rectangles represents land parcels. 
In LandP, the thematic attributes are idl, name and owner, whereas geometry is the spatial attribute of 
dimension 2. Similarly for Building, which has only idl as a thematic attribute. 

The following sentences are denial SICs: (The symbol V stands for the universal closure of the formula 
that follows it.) 

y^{LandP{idli,ni, oi; si) A LandP{idl2, ^2, 02; 52) A idli ^ idl2 A IIntersects{si, 82))- (3) 

y^{Building{idb] si) A LandP{idl,n,o] S2) A Overlaps {si, 82))- (4) 

The SIC ([3]) says that geometries of land parcels with different ids cannot internally intersect (i.e., they 
can only be disjoint or touch). The SIC (|4]) establishes that building blocks cannot (partially) overlap land 
parcels. □ 



"^Denial constraints are easier to handle in the relational case as consistency with respect to them is achieved by tuple deletions 
only 
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A database instance D for schema H can be seen as an interpretation structure for the language C{T^). For 
a set of SICs in D \= denotes that each of the constraints in is true in (or satisfied by) D. In 

this case, we say that D is consistent with respect to Correspondingly, D is inconsistent with respect to 

denoted ^ when there is a ^ G that is violated by D, i.e., not satisfied by D. The instance in 
Example [21 is consistent with respect to its SICs. 

In what follows, we will assume that the set of SICs under consideration is logically consistent; i.e., 
that there exists a non-empty database instance D (not necessarily the one at hand), such that D \^ "if. For 
example, any set of SICs containing a constraint of the form M^^^sMx -^{R{x; s) A Equals{s^ s)) is logically 
inconsistent. The analysis of whether a set of SICs is logically consistent or not is out of the scope of this 
work. 

3 A Repair Semantics 

Different alternatives for update-based consistency restoration of spatial databases are discussed in ll28l . One 
of the key criteria to decide about the update to apply is minimality of geometric changes. Another important 
criteria may be the semantics of spatial objects, which makes changes over the geometry of one type of object 
more appropriate than others. For this work, the repair semantics is a rule applied automatically. It assumes 
that no previous knowledge about the quality and relevance of geometries exists and, therefore, it assumes 
that geometries are all equally important. 

On the basis on the minimality condition on geometric changes and the monotonicity property of some 
topological predicates ll28]| . we propose to solve inconsistencies with respect to SICs of the form ^ through 
shrinking of geometries. Notice that this repair semantics will be used as an instrumental concept to formal- 
ize consistent query answers (no actual modification over the database occurs). As such, it defines what part 
of the geometry is not in conflict with respect to a set of integrity constraints and can, therefore, be part of a 
consistent answer. 

Shrinking geometries eliminates conflicting parts of geometries without adding new uncertain geome- 
tries by enlargement. In this way, we are considering a proper subset of the possible changes to fix spatial 
databases proposed in [|29]| . We disregard translating objects, because they will carry potentially new con- 
flicts; and also creating new objects (object splitting), because we would have to deal with null or unknown 
thematic attributes. 

The SICs of the form ^ exclude the topological predicate Disjoint. The reason is that falsifying an 
atom DJ{gi^g2) by shrinking geometries is not possible, unless we make one of them empty. However, 
doing so would heavily depend upon our definition of this topological predicate for empty regions. Since 
we opted for not making our approach and results depend on this particular definition, we prefer to exclude 
the Disjoint predicate from our considerations. The study of other repair semantics that sensibly includes 
the topological predicate Disjoint will be left for future work. 

Technically, a database D violates a constraint \lxiX2^^^^ siS2 si) A i?2(^2;52) A (/^ A 

T{si^S2)), with T G tH when there are data values ai,a2,^i,^2, with ^1,^2 7^ ^0, for the variables 
in the constraint such that si) Ai?2(^2; 52) A (/^ A T(5i, 52)) becomes true in the database under 

those values. This is denoted with D ^ si) A i?2(^2; S2) f\ T{si^ S2)) [^i, a2, ^1, ^2]- When 

this is the case, it is possible to restore consistency of D by shrinking gi or ^2 such that T(^i, ^2) becomes 
false. 

^For simplicity and without lost of generality, in the examples we consider denial constraints with at most two spatio-relational 
predicates and one topological predicate. However, a denial constraint of the form (|2} may have more spatio-relational predicates 
and topological predicates. 
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We can compare geometries, usually an original geometry and its shrunk version, by means of a distance 
function that refers to their areas. We assume that area G O is an operator that computes the area of a 
geometry. 

Definition 2 For regions ^1,^2, ^(^1,^2) = area{Difference{gi, g2) U Difference{g2, gi)). □ 

Since we will compare a region gi with a region ^2 obtained by shrinking gi, it will hold ^2) ^ 0. In- 
deed, when comparing ^2 C the distance function can be simplified by 5(^1,^2) = area{Difference{gi, g2)). 
We will assume that it is possible to compare geometries through the distance function by correlating their 
tuples, one by one. This requires a correspondence between instances. 

Definition 3 Let D' be database instances of schema H. D' is (D, fD')-indexed if f^' is a bijective 
function from D to D' , such that, for all ci, . . . , c^, s: fD'{R{ci^ ■ ■ ■ ^Cn] s)) = i?(ci, . . . , c^; s'), for some 
region s' . □ 

In a {D^ /|^/)-indexed instance D' we can compare tuples one by one with their counterparts in instance D. 
In particular, we can see how the geometric attribute values differ. In some cases there is an obvious function 
fo', for example, when there is a key from a subset of A to the spatial attribute S, or when relations have a 
surrogate key for identification of tuples. In these cases we simply use the notion of D-indexed. When the 
context is clear, we also use / instead of f^'- 

Examples (example [2] cont.) Consider the relational schema LandP{idl^ name^ owner; geometry). For 
the instance D given in Example [2l the following instance is (D,/) -indexed 
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Here, f {LandP (idli^ni^ CLi'-,gi)) = LandP (idli^ni^ ai; ^7), etc. □ 

When restoring consistency, it may be necessary to consider different combinations of tuples and SICs. 
Eventually, we should obtain a new instance, hopefully consistent, that we have to compare to the original 
instance in terms of their distance. 

Definition 4 Let D^D'hQ spatial database instances over the same schema H, with D' {D^ /) -indexed. The 

distance A{D,D^) between D and is the numerical value A{D, D') = S^-^2^5(n^(f), n^(/(f))), 
where lisip) is the projection of tuple i on its spatial attribute S. □ 

Now it is possible to define a "repair semantics", which is independent of the geometric operators used to 
shrink geometries. 

Definition 5 Let D be a spatial database instance over schema S, a set of SICs, such that D ^ (a) 
An s-repair of D with respect to is a database instance D' over S, such that: (i) D' |= (ii) D' is 
{D, /)-indexed. (iii) For every tuple i?(ci, . . . ,Cn;g) e D, if /(i?(ci, . . . , c^; g)) = i?(ci, . . . , c^; g'), then 
9^ ^ 9' (b) A minimal s-repair D' of D is a repair of D such that, for every repair D" of i?, it holds 
A{D,D') > A{D,D'). □ 

stands for geometric inclusion 
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Proposition 1 If is consistent with respect to then D is also its only minimal s-repair. 

Proof: For = D, it holds: (i) ^ (ii) is {D, /)-indexed, (iii) for every tuple i?(ci, . . . , c^; ^) G 
i^, if /(i?(ci, . . . , c^; ^)) = i?(ci, . . . , c^; then = g. In this case, A(i?, i?') = 0. Any other con- 
sistent instance D^^ obtained by shrinking any of D's geometries and still obtaining admissible geometries 
gives A(i?,i?'0 > 0. □ 

This is an "ideal and natural" repair semantics that defines a collection of semantic repairs. The defini- 
tion is purely set-theoretic and topological in essence. It is worth exploring the properties of this semantics 
and its impact on properties of consistent query answers (as invariant under minimal s-repairs) and on log- 
ical reasoning about them. However, for a given database instance we may have a continuum and infinite 
number of s-repairs since between two points we have an infinite number of points, which we want to avoid 
for representational and computational reasons. 

In this work we will consider an alternative repair semantics that is more operational in nature (cf. 
Definition [8]), leaving the previous one for reference. This operational definition of repair makes it possible 
to deal with repairs in current spatial DBMSs and in terms of standard geometric operators (cf. Lemma [T). 
Under this definition, there will always be a finite number of repairs for a given instance. Consistency will 
be restored by applying a finite sequence of admissible transformation operations to conflicting geometries. 

It is easy to see that each true relationship (atom) of the form T(^i, ^2), with T E T, can be falsified 
by applying an admissible transformation in to gi or ^2- Actually, they can be falsified in a canonical 
way. These canonical falsification operations for the different topological atoms are presented in Table |2] 
They have the advantages of: (a) being defined in terms of the admissible operators, (b) capturing the repair 
process in terms of the elimination of conflicting parts of geometries, and (c) changing one of the geometries 
participating in a conflict. 

More specifically, in Table [2] we indicate, for each relation T G T, alternative operations that falsify 
a true atom of the form T{gi^g2). Each of them makes changes on one of the geometries, leaving the 
other geometry unchanged. The list of canonical transformations in this table prescribes particular ways of 
applying the admissible operators of Definition [T] Later on, they will also become the admissible or legal 
ways of transforming geometries with the purpose of restoring consistency. 

For example. Table |2] shows that for Overlaps {OV), there are in principle four ways to make false an 
atom Overlaps (gi ^ g2) that is true. These are the alternatives L to 4. in that entry, where alternatives L and 
2. change geometry gi, and alternatives 3. and 4. change geometry ^2- Only one of these alternatives that 
satisfies its condition is expected to be chosen to falsify the atom. A minimal way to change a geometry 
depends on the relative size between overlapping and non-overlapping areas: (i) when the overlapping area 
between gi and ^2 is smaller than or equal to their non-overlapping areas, a minimal change over geometry 
gi is Difference {gi ^ g2), and over ^2 is Difference {g2 ^ gi) (cases L and 3. for OV in TableO. (ii) When the 
non-overlapping areas of gi or ^2 are smaller than the overlapping area, a minimal change over geometry gi 
is Difference{gi^ Difference (gi ^ g2)), and over geometry g2 is Difference(g2^ Difference (g2 ^ gi)) (cases 
2. and 4. for in Table [2]) . 

For the case when Equals{gi , ^2) is true, the transformations in Table |2]make either geometry, gi or g2 
empty to falsify the atom. However, there are other alternatives that by shrinking geometries would achieve 
the same result, but also producing smaller changes in terms of the affected area. A natural candidate update 
consists in applying the transformation g'^ = Difference{gi^Buffer{Boundary{g2),d)) (similarly and alter- 
natively for g2). In this case, we just take away from gi the part of the internal area of width d surrounding 
the boundary of gi, to make it different from ^2- We did not follow this alternative for practical reasons: 
having two geometries that are topologically equal could, in many cases, be the result of duplicate data, and 
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Pred. T 


A trae atom T{gi,g2) becomes a false atom T{g[,g2) with 


OV 


1. If area{gi Ci g2) < area{gi \ g2): 
9i = Difference{gi,g2), 92 = 92- 

91 = Difference {gi, Difference{gi, g2)). 92 = 92- 

3. If area{gi H ^2) < area(g2 \ ^1): 

92 = Difference{g2,gi), g[ = gi- 

4. If area{gi H ^2) > area{g2 \ ^1): 

^2 = Difference{g2, Difference{g2, gi)), g[ = gi- 


IS, CB 


1. If area(^i n ^2) < area{g2 \ ^1): 
^2 = Difference{g2,gi), g[ = gi- 

2. If area(^i H ^2) > area{g2 \ ^1): 

^2 = Difference {g2, Difference{g2, gi)). 9i= 91- 

3. ^1 = Difference(gi,g2), ^2 = ^2- 


IC, CV 


1. If area{gi H ^2) ^ area{gi \ ^2)- 
^1 = Difference{gi,g2), ^2 = ^2- 

2. If areaiqi H 02) > areaiqi \ (72): 

^1 = Difference{gi, Difference{gi, g2)). 92 = 92- 

3. = Difference{g2,gi), g[ = ^1. 


II, WI, CO 


1. gi = Differenceigi, g2), go = ^2- 

2. ^2 = Difference(g2,gi), g[ = gi- 


TO, IT 


1. — Differenceiq^ , buffer (qo, d)), qU — qo. 

2. ^2 = Difference {g2, buffer{gi,d)), g[ = gi. 
(See Remark [T] for definition of d) 


EQ 


^'9\ ^9(Z)^92 = 92- 
2. 92 = 9(Z)^9[ =91' 



Table 2: Admissible transformations 



one of them should be eliminated. Moreover, this alternative, in comparison with the officially adopted in 
this work, may create new conflicts with respect to other SICs. Avoiding them whenever possible will be 
used later, when designing a polynomial algorithm for CQA based on the core of an inconsistent database 
instance (see Section [5]). 

Table [2] shows that Touches and Intersects are predicates for which the eliminated area is not completely 
delimited by the real boundary of objects. Actually, we need to separate the touching boundaries. We do so 
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by buffering a distance d around one of the geometries and taking the overlapping part from the other onejj 
The following result is obtained directly from Table |2] 

Lemma 1 For each topological predicate T G T and true ground atom T{gi,g2), there are geometries 
^^,^2 obtained by means of the corresponding admissible transformation in Tabled such that T{g[^g2) 
becomes false. □ 

The following definition defines, for each geometric predicate T, a binary geometric operator tr^ such that, 
if r(^i, ^2) is true, then tr^(^i, ^2) returns a geometry g[ such that T{g[,g2) becomes false. The definition 
is based on the transformations that affect geometry gi in Tabled 

Definition 6 Let T G T be a topological predicate. We define an admissible transformation operator tr^ : 
Ad X Ad ^ Ad as follows: 

(a) If T{gi,g2) is false, then tr^(^i, ^2) := ^i- 

(b) If T(^i, ^2) is true, then: 

f Difference{gi,g2) if area{gi H ^2) < area{gi \ ^2) 
[ Difference [gi , Difference (^1 , ^2 ) ) , otherwise 
for Te {OV.IC, CV}] 

Difference{gi,g2), forT G {IS,CBJI,WI,CO}; 
Difference{gi, Buffer{g2, d)) for T G {TO, IT}; 

g^forTe{EQ}. □ 



tr^ {91^92) := 

tr^{9i,92) := 

tr^{9i,92) := 

tr^{9i,92) := 



It can be easily verified that the admissible operations tr^ , applied to admissible geometries, produce ad- 
missible geometries. They can be seen as macros defined in terms of the basic operations in Definition [H 
and inspired by Table 121 The idea is that the operator tr^ takes (^1,^2), for which T{gi^g2) is true, and 
makes the latter false by transforming gi into g[, i.e., T{g[^g2) becomes false. 

Definition[6]can also be used to formalize the transformations on geometry g2 indicated in Tabled First, 
notice that for the converse predicate T"^ of predicate T it holds: T^{gi^g2) true iff T(g2^gi). Secondly, the 
converse of a transformation operator can be defined by {tr^Y := tr^^''\ In consequence, we can apply 
tr^"" to {92^91), obtaining the desired transformation of geometry g2. In this way, all the cases in Table [2l 
are covered. For example, if we want to make false a true atom Inside {gi ^ g2), we can apply tr^^{gi^ g2), 
but also tr^^{g2^ 9i)' 

Example 4 Table [3] illustrates the application of the admissible transformations to restore consistency of 
predicates T G {Overlaps, Touches}. The dashed boundary is the result of applying Buffer{g, d). □ 

We now define the notion of accessible instance that results from an original instance, after applying admis- 
sible transformation operations to geometries. The application of sequences of operators solves violations 
of SICs. Accordingly, the accessible instances are defined by induction. 

^The buffer operator does not introduce new points in the geometric representation of objects, but it translates the boundary a 
distance d outwards. 
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Table 3: Examples of admissible transformations 



Definition 7 Let be a database instance. is an accessible instance from (with respect to a finite set 
of SICs ^^), if i?' is obtained after applying, a finite number of times, the following inductive rules (any of 
them, when applicable): 

(1) . = D. 

(2) . There is an accessible instance Dq from D, such that, for some ^ G with a topological predicate T, 
£^0 ^ ^through tuples Ri{ai;gi) and i?2(^2; ^2) in i?o. for which r(^i, ^2) is true; and 

(a) = Do\ {Ri{di,gi)} U {i?i(ai, tr^(^i, ^2))}, or 

(b) i?^ = i?o \ {i?2(a2,^2)} U {i?2(a2, tr^I^2,^i))}. □ 

Example 5 Consider the database instance in Figure Ua) that is inconsistent with respect to SIC (|3]). An 
accessible instance from this inconsistent database is in Figure Sfb), where only gi has changed. This can 
be expressed in the following way: LandP{idli^ni^oi; g[) = Lan(iP(z(i/i, ni, oi; tr^^(tr^^(^i, ^2), ^'s))- 
□ 

Given a database D, possibly inconsistent, we are interested in those accessible instances that are consis- 
tent, i.e., ^ Even more, having the repairs in mind, we have to make sure that admissible instances 
from D can still be indexed with D. 

Proposition 2 Let be an accessible instance from D. Then, is /-indexed to D via an index function 
/, that can be defined by induction on D\ 

Proof: To simplify the presentation, we will assume that D has an index (or surrogate key) io, that is a 
one-to-one mapping from D to an initial segment [1, A^] of N. Let be an accessible instance from D. We 
define iD'{R{a] g)) G N for tuples in D' by induction on D': 

(1) . IfD'^D and i?(a; g) G iD'{R{a- g)) = io{R{a- g)). 

(2) . If there is an accessible instance Dq from D and ^ ^ ^ ^ through the atoms i?i(ai; ^1), i?2(^2; ^2), 
and T(^i, ^2) with T and the converse relation of T: 

may have more than one topological predicate. 
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(a) i?' = i?o \ {Ri{ai,gi)} U {i?i(ai, tr^(^i, ^2))}, and Zi^/(i?i(ai; tr^(^i, ^2))) = iDo{Ri{o^ii 9i)) 
and iD'{R2{a2]g2)) = ^1^0(^2(^2,^2)), or 

(b) D' = Do\ {i?2(tt2,^2)} U {i?2(tt2,tr^'(^2,^i))}, and iD'{Ri{ai; gi)) = iDo{Ri{ai, gi)) and 

iD'{R2{a2]tr^\g2^gi))) = ^L>o(^2(tt2,^2)). □ 



Any two accessible instances D' and can be indexed via in a natural way, and thus, they can be com- 
pared tuple by tuple. In the following, we will assume, when comparing any two accessible instances in this 
way, that there is such an underlying index function /. Now we give the definition of operational repair. 

Definition 8 Let D be an instance over schema H and a finite set of SICs. (a) An o-repair of D with 
respect to is an instance D' that is accessible from D, such that D' ^ (b) A minimal o-repair D' of D 
is an o-repair of D such that, for every o-repair D" of D, A{D^ D") > A{D^ D'). (c) Rep{D^ ^^) denotes 
the set of minimal o-repairs of D with respect to □ 

The distances A(i?, V) and A(i?, D') in this definition are relative to the corresponding index functions, 
whose existence is guaranteed by Proposition |2l Unless otherwise stated, this is the repair semantics we 
refer to in the remainder of the paper, in particular, in the definition of consistent query answer in Section 
m In consequence, in the following a repair is an o-repair, and the same applies to minimal repairs. Even 
more, whenever we refer to repairs, we should understand that minimal repairs are intended. 

Example 6 Consider database schema in Example [21 The instance D in Figure |5]is inconsistent with respect 
to the SICs ([3]) and dJ)), because the land parcels with geometries ^2 and gs overlap, and so do the land parcels 
with geometries ^2 and ^4. Likewise, buildings with geometry g^ and gQ partially overlap land parcels with 
geometries gi and g2, respectively. 
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Figure 5: Inconsistent database instance 
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Figure 6: Minimal repairs 



Figure [6] shows the two minimal repairs of D. In them, the regions with thicker boundaries are the re- 
gions that have their geometries changed. For the minimal repair in Figure |6t a), the inconsistency involving 
geometries 92 and ^3 is repaired by applying Dijference{92^ 93) to 92, i.e., removing from 92 the whole 
overlapping geometry, and keeping the geometry of ^3 as originally. Notice that due to the interaction be- 
tween integrity constraints, if we apply Difference{93 ^ 92) to ^3, i.e., we remove the whole overlapping 
area from ^3, we still have an inconsistency, because the building with geometry will continue partially 
overlapping geometries 92 and ^3. Thus, this change will require an additional transformation to ensure that 
9q is completely covered or inside of ^3. 

In the same minimal repair (Figure Ha)), the inconsistency between 92 and 94 is repaired by shrinking 
92, eliminating its area that overlaps ^4. This is obtained by applying Dijference(92^ 94) to 92. Finally, 
the inconsistency between 91 and 9^ is repaired by removing from 9^ its part that does not overlap with 
geometry 91. In principle, we could have repaired this inconsistency by eliminating the overlapping region 
between 91 and ^5, but this is not a minimal change. 

In the second minimal repair (Figure Hb)), geometries 92 and 9^ undergo the same changes than those 
in the first minimal repair (Figure IQa)), but the inconsistency between 92 and 94 is restored by eliminating 
geometry 94, i.e., applying Difference (^4, ^2) = ^0- □ 

Notice that, by applying admissible transformation operators to restore consistency, the whole part of a 
geometry that is in conflict with respect to another geometry is removed. In consequence, given that there 
are finitely many geometries in the database instance and finitely many SICs, a finite number of applications 
of admissible transformations are sufficient to restore consistency. This contrasts with the s-repair semantics, 
which can yield even a continuum of possible consistency-restoration transformations. Keeping the number 
of repairs finite may be crucial for certain mechanisms for computing consistent query answers, as those 
as we will show in the next sections. Actually, we will use existing geometric operators as implemented in 
spatial DBMSs in order to capture and compute the consistency-restoring geometric transformations. This 
will be eventually used to obtain consistent query answers for an interesting class of spatial queries and SICs 
in Section [521 
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Figure 7: An inconsistent database instance 
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Figure 8: Minimal repairs: (a) minimal s-repair and (b) minimal o-repairs (thick boundaries show geometries 
that have changed) 

Despite the advantages of using o-repairs, the following example shows that an o-repair may not be 
minimal under the s-repair semantics. 

Example 7 The instance D in Figure |7]is inconsistent with respect to the SICs ^ and (|4]), because the land 
parcels with geometries gi and 92 internally intersect and buildings with geometry g^ and g^ overlap land 
parcels with geometries gi and g2, respectively. 

Figures (Ha) and (b) show the minimal s-repair (Definition |5]) and o-repairs of D (Definition [8]), respec- 
tively. In them, the regions with thicker boundaries are the regions that had their geometries changed. Here, 
by applying s-repair semantics we obtain one minimal repair (Figure (Ha)) that takes the partial conflicting 
parts from both land parcels gi and ^2 in conflict, and leave unchanged the geometries of buildings gs and 
^4. Instead, for the o-repair semantics, each repair takes the whole conflicting parts from one of the land 
parcels gi or ^2 in order to satisfy SIC (|3]), and to satisfy SIC dU, each repair eliminates the conflict between 
the new version of gi and building gs or between the new version of ^2 and building ^4. This makes up to 
four possible o-repairs (Figure |8lb)), which are not minimal with respect to the single s-repair. □ 

S -repairs may take away only parts of a geometry that participate in a conflict. On the other side, they do not 
force a conflicting geometry to become empty in cases where o-repairs would do so. For instance, consider 
a true atom Equals{gi^g2) that has to be falsified. A s-repair can be obtained by shrinking one of the two 
geometries just a little, without making it empty. However, by using admissible transformations, we can 
only falsify this atom by making one of the geometries empty. In this case, a minimal o-repair is not a 
minimal s-repair. 

Proposition 3 Let be a database instance, and a set of SICs. Then the following properties for o- 
repairs hold: (a) If D is consistent with respect to then D is its only minimal o-repair. (b) \f D' is an 
{D^ /)-indexed o-repair of D and f(R{a; g)) = i?(a; g'), then g' C g. (c) The set of o-repair for D is finite 
and non-empty. 

Proof: (a) By the inductive definition of o-repair, an admissible transformation operator is applied to a 
geometry g when g is in conflict with other geometry g' 'mD. Since a consistent database instance does not 
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Figure 9: Example of a region's boundary after geometric transformation 



contain conflicting tuples, none of the transformations operators is applicable and the consistent database 
instance is its only o-repair. 

(b) The application of each admissible transformation tr^{gi^g2), with T eT, has five possible outcomes: 

gi,g(Z), Dijference{gi,g2), Difference (gi, Difference{gi, ^2)), or Dijference{gi, Buffer {g2, d)). Then, by 
definition of operator Difference (cf. Definition [T]), tr^{gi^g2) ^ gi- 

(c) D has a finite number N of tuples; and a finite number of integrity constraints. In consequence, there is 
a finite number of conflicts, i.e., sets of tuples that simultaneously participate in the violation of one element 
^ of via their geometries. Each of these conflicts are solved by shrinking some of those geometries. 
Each application of an operator O, chosen for a finite set of them, according to the inductive definition of 
o-repair solves an existing conflict by falsifying at least one of the T- atoms in a ground instance of ^. In 
principle, the application of such an operator O may produce new conflicts; however, it strictly decreases 
the total geometrical area of the database instance. More precisely, if A{D') := ^R(t;g)eD'(^^^(^(9)^ then 
A{D^) > A(0{D')), where 0{D') is the instance resulting from the conflict resolving operator O to D' . In 
particular, is the area A{D) of the original instance D. 

Now we reason by induction on the structure of o-repairs. The application of a one-conflict solving 
operator O to an instance D^-i produces an instance Dn with A{Dn) < A{Dn-i). Moreover, A{Dn-i) — 
A{Dn) > e > 0, where e represents a lower bound of the area reduction at each inductive step. 

We claim that, due to our repair semantics, this lower bound e depends on the initial instance D, and 
not on n. In order to prove this, let us first remark that an admissible region is fully determined by its 
boundaries. Now we prove that the regions in any accessible instance depend on the regions in the original 
database instance, or, more precisely, by the boundaries delimiting those regions. We prove it by induction 
on the number of inductive steps of the definition of accessible instances. 

First, we prove that it works for the first repair transformation on the original database instance. Let 
g[ = Tr^(^i, ^2) be the first transformation applied on region gi to create the accessible instance Di from 
the original database instance D. For T G T \ {TO, IT}, and following the definitions of admissible 
transformations in Table |2l the geometry g[ is either g^ or a region whose boundary is formed by parts 
of the boundaries that limit regions gi and g2 (see example of overlapping regions in Figure |9]). For T G 
{TO, IT}, g[ is formed by parts of the boundary of region gi and the boundary created by buffering d 
around ^2- So, in this case, the boundary of g[ depends exclusively on the boundaries of gi and g2, and of 
the constant d. 

Let assume that the geometries in an accessible instance Dn obtained after n inductive steps are g^ or 
regions whose boundaries depend on the original instance. In the n + 1 inductive step, another transforma- 
tion g'- = Tr^ {gi^gj) is applied. Following the definition of admissible transformations, g[ becomes the 
geometry g^ or a region whose boundary is formed by part of the boundary of gi and part of the boundary 
of gj, as in the first inductive step. Thus, g[ also depends on the original database instance. This establishes 
our claim. 
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Now, by the Archimedean property of real numbers, there is a number M such that — Me < 0. Thus, 
after a finite number of iterations (i.e. applications of conflict-solving operators), we reach a consistent 
instance or an instance with area 0, i.e. all of whose geometries are empty, which would be consistent too. 

Notice that the number M provides an upper bound on the number of times we can apply operators to 
produce a repair. At each point we have a finite number of choices. So, the overall number of o-repairs that 
can be produced is finite. □ 



The following example shows that, even when applying admissible transformations, there may be expo- 
nentially many minimal repairs in the size of the database, a phenomenon already observed with relational 
repairs with respect to functional dependencies [|6|. 

Example 8 Consider the schema in Example |2l and the SIC (|3]). The database instance contains n spatial 
tuples, as shown in Figure [TOl There are n — 1 overlappings and n overlapping geometries. 
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Figure 10: Exponential number of repairs 



In order to solve each of those overlaps, we have the options of shrinking either one of the two regions 
involved. We have 2^~^ possible minimal repairs. □ 

The following remark is important when estimating the data complexity of repairs, because, in this case, data 
complexity does not only depend on the number of tuples, but also on the size of geometric representations. 

Remark 2 Transformation operators that make geometries empty reduce the size of geometric representa- 
tions. Any other admissible transformation operator tr^(^i, ^2) shrinks gi, and uses gi and ^2 to define 
the new boundary of gi. Thus, we are using, in a simple manner, the original geometric representation (e.g. 
points in the boundaries of the original geometries) to define a new geometry. It is clearly the case that there 
is a polynomial upper bound on the size of the representation of a new geometry in an o-repair in terms of 
the size of the original database, including representations of geometric regions. □ 



4 Consistent Query Answers 

We can use the concept of minimal repairs as an auxiliary concept to define, and possibly compute, consistent 
answers to a relevant class of queries in 

A general conjunctive query is of the form: 

)A(/p), (5) 

where v = (Ui(^i ^ {^i})) ^ V the free variables, and (fissi conjunction of built-in atoms over thematic 
attributes or over spatial attributes that involve topological predicates in T and geometric operators in O. 
We also add a safety condition, requiring that variables in (f also appear in some of the Ri. For example, the 
following is a conjunctive query: 

Q(x, y] s) : 3siS2{R{x] si) A R{y] S2) A Inters ects{si, S2) A x ^ y A s — Difference{si, 52)). 
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We will consider the simpler but common and relevant class of conjunctive queries that are operator free, 
i.e., conjunctive queries of the form (|5]) where Lp does not contain geometric operators. We will also study 
in more detail two particular classes of conjunctive queries: 

(a) Spatial range queries are of the form 

Q{u', s) : 3z{R{x', s) A T{s, w)), (6) 

with w a spatial constant, and z C x. This is a "query window", and its free variables are those in 
u — {x \z) ox in {5}. 

(b) Spatial join queries are of the form 

Q{u] 51, 52) : 5i) A i?2(^2; ^2) A T{si, 52)), (7) 

with T eT, and z C xiU X2. The free variables are those inu = ((xi U X2) \ z) or in {51, 52}. 
We call basic conjunctive queries to queries of the form (|6]) or (|7} with T G {Ilntersects, Intersects}. 

Remark 3 Notice that for these two classes of queries we project on all the geometric attributes. We will 
also assume that the free variables correspond to a set of attributes of R with its key of the form ([T). More 
precisely, for range queries, the attributes associated with u contain the key of R. For join queries, uD xi 
and ^xnx2 contain the key for relations Ri, R2, respectively. This is a common situation in spatial databases, 
where a geometry is retrieved together with its key values. □ 

Given a query Q{x;s), with free thematic variables x and free geometric variables s, a sequence of the- 
matic/spatial constants (c; g) is an answer to the query in instance D if and only if D Q(c; g), that is the 
query Q becomes true in D as a formula when its free variables x, 5 are replaced by the constants in c, g, 
respectively. We denote with Q{D) the set of answers to Q in instance D. 

Example 9 Figure [U] shows an instance for the schema 71 = {LandP{idl; geometry), Building(idb; 
geometry)}. Here, idl, idb are keys for their relations. Dark rectangles represent buildings, and white 
rectangles represent land parcels. The queries Qi and Q2 below are a range and a join query, respectively. 
For the former, the spatial constant is the spatial window shown in Figure [U] namely the (closed) polygon 
obtained by joining the four points in order indicated in the query. 

Qi(idb;g) : Building (idb; g) A 

Intersects{g, ([xi,yi], [x2,^i], [x2,y2], [^1,1/2], [xi.yi])). 
Q2{idl^idl' ]g^g') : LandP(idb]g) A LandP{idb' ] g') A Touches(g^ g). 

The answer to Qi is {idb2]g^). The answers to Q2 are: {(id/i, idl2]gi^g2), {idh^ ^^^3; 5^2, ^3), {idli, idls; 
^1,^3), {idl2,idli-g2,gi), {idh.idW, g^, g2) , {idk, idh; gs, gi)}. □ 

Now we define the notion of consistent answer to a conjunctive query. 

Definition 9 Consider an instance a set of SICs, and a conjunctive query Q{x;s). A tuple of the- 
matic/geometric constants (ci, . . . , c^; ^1, . . . , is a consistent answer to Q with respect to if: (a) 
For every G Rep{D^ ^^), there exist g'^,...^g[ such that D' |= Q(ci, . . . , c^; g'l,.. • , 9i 
intersection over all regions g[ that satisfy (a) and are correlated to the same tuple in (7on(Q, D^"^) 
denotes the set of consistent answers to Q in instance D with respect to □ 

^Via the correlation function /, cf. Definition[3] 
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Figure 1 1 : Example of a range query 
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Figure 12: Consistent answers 



Since Q is operator free, the regions g[ appear in relations of the repairs, and then can be applied. 
However, due to the intersection of geometries, the geometries in a consistent answer may not belong to the 
original instance or to any of its repairs. 

In contrast to the definition of consistent answer to a relational query [|2l, where a consistent answer is 
an answer in every repair, here we have an aggregation of query answers via the geometric intersection and 
grouped-by thematic tuples. This definition is similar in spirit to consistent answers to aggregate relational 
queries with group-by [l3]l9l[T7l. 

This definition of consistent answer allows us to obtain more significative answers than in the relational 
case, because when shrinking geometries, we cannot expect to have, for a fixed tuple of thematic attribute 
values, the same geometry in every repair. If we did not use the intersection of geometries, we might lose or 
not have consistent answers due to the lack of geometries in common among repairs. 

Example 10 (example [6] cont.) Consider the spatial range query 

Q{idl; geometry) : 3name owner (LandP {idl ^ name ^ owner; geometry) A 

Intersects{geometry, ([xi,yi], [x2,yi], [^2,^2], [^1,2/2], [xi.yi])), 
which is expressed in the SQL language as: 

SELECT idl, geometry 
FROM LandP 

WHERE Intersects {geometry, ([xi,^i], [x2,yi], [x2,y2], [^1,2/2], [xi,yi])). 

Now, consider the two minimal repairs in Figure |6] In them, objects idk and idls do not change geometries, 
whereas object idl2 does, from ^2 to ^25.^2' ^^^P- (^f- Figure |6ta), (b), resp.). 

From the first repair we get the following (usual) answers to the query: {idli;gi), {idl2] ^2)5 (^^^35 ^3)- 
From the second repair, we obtain {idli;gi), {idl2] ^2)5 (^^^35 ^3)- The consistent answers are the tuples 
shown in Figure [121 where the answers obtained in the repairs are grouped by an idl in common, and the 
associated geometries are intersected. In this figure, the geometry with thicker lines corresponds to the 
intersection of geometries obtained from different repairs. 

From a practical point of view, the consistent query answer could include additional information about 
the degree in which geometries differ from their corresponding original geometries. For example, for the 
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Figure 13: The core of an instance 



answer {idh] ^2)' additional information could be the relative difference between areas g2 and which 



The definition of consistent query answer relies on the auxiliary notion of minimal repair. However, since we 
may have a large number of repairs, computing consistent answers by computing, materializing, and finally 
querying all the repairs must be avoided whenever there are more efficient mechanisms at hand. Along these 
lines, in this section we present a methodology for computing consistent query answers to a subclass of 
conjunctive queries with respect to certain kind of SICs. It works in polynomial time (in data complexity), 
and does not require the explicit computation of the database repairs. 

We start by defining the core, which is a single database instance associated with the class of repairs. 
We will use the core to consistently answer a subclass of conjunctive queries. Intuitively, the core is the 
"geometric intersection" of the repairs, which is obtained by intersecting the geometries in the different 
repair instances that correlate to the same thematic tuple. 

Definition 10 For an instance D and a set of SICs, the core of D is the instance given by i?* := 
{R{a]g~') \ R e n, there is i?(a; ^) G D md g"" = f]{g' \ R{a;g') E for some G Rep{D,'^) 
and i?(a; g') = fD'{R{a; g))}}. Here, fo' is the correlation function for □ 

Sometimes we will refer to by Rep{D^ ^^). However, it cannot be understood as the set-theoretic 
intersection of the repairs of D. Rather it is a form of geometric intersection of geometries belonging to 
different repairs and grouped by common thematic attributes. It is important to remark that the keys of 
relations remain in the repairs, and therefore they appear in the core of a dimension instance. 

Example 11 Figure [13] shows the core of the database instance in Figure [5] considering the repairs in Fig- 
ure [6l Here, ^2 results from the geometric intersection of geometries ^2 92 ^^e minimal repairs in 
Figure [6l Similarly, gl is ^5, because the latter is shared by both minimal repairs in Figure [6l Geometry g^ 
becomes ^0 in the core. All other geometries in the core are unchanged with respect to geometries in the 
original database instance. □ 

Notice the resemblance between the definitions of consistent answer and the core. Actually, it is easy to see 
that =^{j Con{QR^ ^^), where the query Qr{x; s) : R{x; s) asks for the tuples in relation R. 

The core is defined as the geometric intersection of all database repairs. However, as we will show, for 
a subset of SICs we can actually determine the core without computing these repairs. This is possible for 

^^Here, H ^ set- theoretic intersection of geometries. 



is calculated by S{g2^g2)/cirea{g2). 



□ 



5 Core-Based CQA 
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SICs of the form: 



\/xiX2SiS2^{R{xi;si) A R{x2;s2) A x[ X2 A T{st,S2)), (8) 

where T G {Ilntersects^ Intersects^ Equals}, x[ C xi, x'2 ^ ^2, and both x'^ and x'2 are the key of R. In 
these SICs there are two occurrences of the same database predicate in the same SIC. The following example 
illustrates this class of SICs. 

Example 12 For the schema — {County(idc^ name; geometry), Lake(idl; geometry)}, with zdc the 
key of County and idl the key of Lake, the following SICs are of the form (|8]): 

y^{County{idci,ni; si) A County{idc2,n2] S2) Aidci ^ idc2 A IIntersects{si, S2)). (9) 

y^{Lake{idli] si) A Lake{idl2] S2) A idli ^ idl2 A Inters ects{si, S2)). (10) 

□ 

Remark 4 This subset of SICs has the following properties, which will be useful when trying to compute 
the repairs and the core: 

(i) Two SICs of the form ([8]) over the same database predicate are redundant due to the semantic in- 
terrelation of the topological predicates Ilntersects, Intersects, and Equals: only the constraint that 
contains the weakest topological predicate has to be considered. For example, Intersects is weaker 
than Ilntersects, and Ilntersects is weaker than Equals. 

(ii) Conflicts between tuples with respect to SICs of the form ([8]) are determined by the intersection of 
their geometries. The conflict between two tuples R{ai;gi) and R{a2'i 92) is solved by applying a 
single admissible transformation operator tr^{gi,g2) (or tr^"" {g2^gi)) that modifies gi (or ^2), and 
makes T(^i, ^2) (and T^(^2, ^1)) false. 

(iii) Solving conflicts with respect to a SIC of the form ^ is independent from solving a conflict with 
respect to another SIC of form ^ over a different database predicate. 

(iv) Solving a conflict between two tuples with respect to a SIC of the form ([8]) does not introduce new 
conflicts. This is due to the definition of admissible transformations and the monotonicity property of 
predicates Ilntersects and Intersects, which prevent a shrunk geometry (or even an empty geometry) 
from participating in a new conflict with an existing geometry in the database (cf. Example [T3]). 

(v) For any two geometries gi and g2 in conflict with respect to a SIC of the form (|8]), there always 
exist two repairs, one with the shrunk version of gi, and another with the shrunk version of ^2- This 
guarantees that there exists a minimal repair that contains a minimum version of a geometry whose its 
geometric intersections with original geometries in conflict have been eliminated (cf. Lemma O. As 
a consequence, the core can be computed by taking from a geometry all its intersections with other 
geometries in conflict, disregarding the order in which these intersections are eliminated. 

This property is not guaranteed for other kinds of SICs. For instance, consider Example [6] with the 
instance in Figure [5] and its corresponding repairs in Figure [6l Although g^ was originally in conflict 
with respect to g2, there is no minimal repair where geometry g^ is shrunk. □ 

We illustrate some of these properties with the following example. 
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Figure 14: An inconsistent database with SICs of the form ^ 



Example 13 (example [12] cont.) Consider the inconsistent instance in Figure [T4l In it, counties with geome- 
tries gi, g2 and gs are inconsistent with respect to SIC (|9]), because they internally intersect. Also, county 
gs internally intersects with geometry g^. Lakes with geometries gQ and g^ violate SIC ([TOl) . because they 
intersect (actually they touch). 

Conflicts with respect to SICs ^ and (fTOl) can be solved in an independent way, since they do not share 
predicates (cf. Remark Uiii)). To obtain a repair, consider first SIC Q and the conflict between ^2 and 
gs, which is solved by applying tr^^(^2, ^3) or tr^^(^3, ^2)- Any of these alternative transformations do not 
produce geometries that could be in conflict with other geometries unless they were originally in conflict (cf. 
Remark Uiv)). For instance, if we apply tr^^(^3, ^2) we obtain a new geometry ^3 that will be in conflict 
with geometries gi and ^4. These conflicts are not new, since gs was originally in conflict with these two 
geometries. Even more, by shrinking ^2 or ^3, none of the modified geometries could be in conflict with 
^5. In addition, although by making ^3 = tr^\gsi92) we also solve the conflict between gi and ^3, this is 
only accomplished due to the fact the conflicting part of ^3 and gi has been already eliminated from ^3 (cf. 
Remark in v)). 

Figure [15] shows the sixteen possible minimal repairs that are obtained by considering the eight possible 
ways in which conflicts with respect to SIC (|9]) are solved, in combination with the two possible ways in 
which conflicts with respect to SIC ([TO]) are solved. In this figure thick boundaries represent geometries that 
have changed. Notice that in this figure we only show i^ot ^22' since the later corresponds to the 

empty geometry which is then omitted in the corresponding repairs. The core for this database instance is 
shown in Figure [16] □ 

It is possible to use a tree to represent all the versions that a geometry g may take in the repairs. The root of 
that tree is the original geometry g, the leaves are all the possible versions of the g in the minimal repairs. 
The internal nodes represent partial transformations applied to g as different conflicts in which g participates 
are solved. For illustration. Figure [T?] shows the tree that represents the possible different versions of ^3 in 
the minimal repairs for the inconsistent instance in Figure [14] Notice that a leaf in this tree represents a 
version of g^ in a repair, which is not necessarily a minimum geometry. For instance, in Figure [17] the 
minimum version of g^ is g'^_^ . For all other non-minimum versions of g^ in the leaves, conflicting areas are 
taken from other geometries. For example, geometry ^3^ results by keeping ^3 as originally and shrinking 
geometries gi, g2 and ^4. 

The following lemma establishes that when a geometry g is involved in conflicts of SICs of the form ([8]), 
there exists a version of g in the repairs that is minimum with respect to set-theoretic (geometric) inclusion. 
This result is useful to show that the minimum version of g is the one that will be in the core. 

We need to introduce the set ^i^,^(a, g) that contains, for a given tuple i?(a; g) in a database instance 
D, all the possible versions of geometry g in the minimal repairs of D. 

Definition 11 Let D a database instance, a set of SICs of the form ([8]) and a fixed tuple i?(a; g) e D. 
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Figure 15: The sixteen possible repairs of Example [13] 



County* 



idl name geometry 



idci ni 

idC2 712 

idcs ns 

idc4 77,4 

idc4 77,5 



9u 

90 

fl5i 




'I1 



93z 



i4, 



Figure 16: The core of Example [13] 
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Then, GR,^{a,g) = {g'\R{a',g') G D\D' G Rep{D,^)}, f-\R{a;g')) = R{a;g). □ 
Lemma 2 The set of geometries 0i^,^(a, g) has a minimum element g^nin under set-theoretic inclusion. 
Proof: By properties of SICs of the form (|8]), for each conflict in which R(a; g) ^ D participates, we can 
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or cannot shrink g. This leads to a combination of possible transformations over geometry g that can be 
represented in a binary tree as shown in Figure [TTI So, we have a non-empty set of geometries ^i^,^(a, g). 

In this tree, we always have a path from the root to a leaf in which the geometry is always shrunk; that is, 
all conflicting areas are eliminated from g. The leaf geometry in this path (repair) is the minimum geometry 

9min ' ' 



Corollary 1 Consider a database instance D, a set of SICs of the form ([8]), and a fixed tuple R{a; g) G D. 
For gmin, the minimum geometry in ^i?,^(a, g), it holds i?(a; gmin) ^ D^- 

Proof: Direct from Lemma |2] and the definition of the core as a geometric intersection. □ 



5.1 Properties of the Core 

In this section we establish that for the set of SICs of the form ([8]), and basic conjunctive queries, it is 
possible to compute consistent answers on the basis of the core of an inconsistent instance, avoiding the 
computation of queries in every minimal repair. This is established in Theorems [T] and |2l respectively. 

Theorem 1 For an instance D, a set of SICs of the form ([8]), and a basic spatial range query Q{u; s), it 
holds (a; g) G Con{Q, D, if and only if (a; g) G Q(i?*). 

Proof: The projection of range queries always includes the key of the relation in the result. Thus, if (a^g) G 
Con{Q, D, ^^), then for every G Rep{D, ^^), there exists R(b; g'), such that a C 6, f~^{R(h] g')) = 
R(h] g) and R(h] g) G D, where T{g' ^ w) is true for the spatial constant w of the range query and g = f]g' 
with the intersection ranging over all g\ 

By LemmaO there exists tuple R(h]gmin) G i^' G Rep{D^'^) with g^nin G Qn^^fih'.g)' If {a'.g) G 
(7on(Q, i?, ^^), with a C b, g ^ f^g' ^ gmin- Also, it must happen that {a; gmin) ^ Q(^0- Then by 
Corollary [I] {a; gmin) ^ Q(i?^), and therefore {a;g) G Q{D~^). 

In the other direction, if (a,^*) G Q(i^*) (with £)* = f]^ Rep^D,"^)), then there exists a tuple 
R{b;g'^) G i?*, with a C b and ^* ^ ^q. By the monotonicity of T G {Intersects^ Ilntersects}, if 
T(g''^w) is true, then for all geometries g' in i?(6; g') G Rep{D, ^^), with ^^^^ C ^ T(^', it;) is 
also true. Then, by Lemma|2]and Corollary [T] ^* = = ^^^^ and (a,^*) G (7on(Q, ^^). □ 

A similar result can be obtained for basic join queries, i.e., queries that consider two database predicates 
(not necessarily different). Notice that for a SIC Lp of the form (|8]) with a database predicate R and a basic 
join query of the form (|7]) with R — Ri — R2, the consistent answers do not contain information from 
tuples that were originally in conflict. This is because by solving conflicts with respect to Lp, all possible 
intersections between tuples in R will be eliminated (a basic join query asks for geometries that intersect). 

The following example illustrates how to compute consistent answers to basic join queries. This example 
will also illustrate the proof of Theorem |2] 

Example 14 (example [13] cont.) Consider the following basic join query posed to the instance D in Exam- 
ple [131 It is asking for the identifiers and geometries of counties and lakes that internally intersect. 

Q{idc,idl; gi, g2) : 3n{County{idc,n] gi) f\ Lake{idl] g2) A IIntersects{gi, g2)). 
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The consistent answer to this query is (zrfca, idl2; ^Qr-^)- Without using the core, this answer is obtained 
by intersecting all answers obtained from every possible minimal repair. The geometries in the repairs of 
D with respect to (SICs ([9]) and ([TOl) ) can be partitioned into the following sets: Q County, ^{idci^^ii 9i) = 

{9U, 9lJ County, ^{idC2, 712; 92) = {90, 92^ 922}^ ^ County,^{idC3, Us; gs) = {^3,, ^32, ^33' ^34' ^35' ^35}' 
G County,^ {idC4, 714; 94) = {^41^^42}' G County M^dc^, n^] 9^) = {^sj' G Lake,^{^dlQ] Qq) = {^61^^62}' 

GLake,^{idl^] ^7) = {9'^^i9'^^}' The minimum geometries in these seven sets are ^0 (corresponding to 
the update of geometry ^0, g'^^, g'^^, g'^^, g'^^, and ^i.^ , respectively. 

Also, for the database predicates County and Lake, there are two sets containing the possible extensions 
of them in the repairs: {County{D')\D' G Rep{D^ ^^)}, containing the eight versions of counties (first 
eight versions of counties in Figure [TST): and {Lake{D')\D' G Rep{D^ ^^)}, with the two instances of lakes 
(one with geometries g^^ and gij^ , and the other with geometries g^^ and gij^ in Figure [15]). Note that the pos- 
sible minimal repairs contain combinations of geometries in sets Gcounty,^ {idc, n; g) and GLake,^{idl; g). 
In particular, there exists a repair that combines the minimum geometries ^3^ and g^^, and another repair 
that combines ^3^ and gi^_^. 

If the topological predicate in the basic join query is satisfied by the combination of two minimum 
geometries, then other versions of these geometries in other repairs (which geometrically include the min- 
imum geometries) will also satisfy it. In this example, ^3^ and gij^ intersect and, by the monotonicity 
property of predicate Ilntersects, all other versions of gs and g^ in other repairs also intersect. As result, 
{idcs^idl2; ^3^ , ^7 J is an answer to the query. Finally, by Corollary [T] ^3^ and gij^ are in the core of the 
database instance and, therefore, {idcs^ idl2] ^3^ , ^7^ ) is also an answer to the query over the core. □ 

Theorem 2 For an instance D, a set of SICs of the form ([8]), and a basic spatial join query Q(xi, X2; 5i, 52), 
it holds (ai,a2,^i,^2) ^ Con{Q,D,^) if and only if (ai, 02, ^1, ^2) ^ Q(^*). 

Proof: The projection of join queries also includes keys. Thus, if (ai, a2; ^1, ^2) ^ Con(Q, then 
we have tuples g'l) G D' , R2(h2] 92) ^ ^^ every D' G Rep{D^ with ai C 61, d2 C 62, and 

T(^^, ^2) tnxe for T in Q. Thus, gi is the intersection of all those g[, and g2 is the intersection of all those 

First, note that if i?i = i?2, only tuples that were not originally in conflict may be in the answer. These 
tuples will be trivially in the core, because no geometric transformations over their geometries are applied. 
Thus, their geometries will be in the answer, if and only if, they satisfy the topological predicate in the query. 

By the property (iii) of SICs of the form ^ (cf. Remark |4]), solving conflicts on two different database 
predicates i?i and R2 are independent. Let us assume that {Ri {D') \ D' Rep{D, } and {R2 {D') \ D' Rep{D^ 
^^)} are the different extensions of predicates Ri and R2 in all possible minimal repairs. Then, Rep{D^ 
contains database instances that result from the combination of these two sets. Consequently, and using 
Lemma[2l for two given hi and 62, there exists a repair D' G Rep{D^ ^) such that i?i(6i; ^^^.^) G D' and 
R2(b2;92rmJ ^ ^'^^^^ ^L^n minimum in ^7^1,^(61, ^1) and ^^^^^ is minimum in Gr2,^{^2, ^2). 

We now prove that if (ai, a2]gi^g2) ^ Con{Q^ D, ^^), then (ai, 02; ^1,^2) ^ Q(i)*). By definition of 
consistent answer, if (ai,a2; ^1,^2) ^ ^^^(2? ^5 then (ai, 02, ^i^.^, ^2^.^) ^ Q(i?0- Corollary[Tl 
(ai,a2;^/i,^2) ^ 2(1?*), with ^1 ^ g'l^^^ and ^2 = ^2^,^. 

In the other direction, if (ai, a2, ^1, ^2) ^ Q(i?*), then {ai,a2,gl,gl) G Con{Q,D,^). By Corol- 
lary [D gl = and gl = and i?i(6i;^l_) G I)* and i?2(62;^L^J G i?*. Then, by mono- 
tonicity property of predicate T G {Intersects^ Ilntersects} in Q, if T{g[^g2) is true, it is also true for 
all Ri{h;g'{) G D^' and in i?2(52;^^0 ^ with i^^^ G Rep{D,^) and C and C 
Therefore, (ai, a2, ^t, ^2) ^ Con(Q, i:), ^^). □ 
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The previous theorems tell us that we can obtain consistent answer to basic conjunctive queries by direct 
and usual query evaluation on the single instance D^, the core of D. This does not hold for non-basic 
conjunctive queries as the following example shows. 

Example 15 Consider a database instance with a database predicate R whose geometric attribute values 
are shown in Figure [TSla). This database instance is inconsistent with respect to a SIC that specifies that 
geometries cannot overlap. Let us now consider a range query of the form 3y{R(x; g) A Touches{g^ s)), 
where 5 is a user defined spatial window, and y ^ x. Figure [TSlb) shows the query over the intersection of 
all repairs (the core), obtaining geometries gl and g2, from where only gl touches s. Figures [TSlc) and (d) 
show the query over each repair, separately. The answer from the repair in (c) is g[, and repair (d) does not 
return an answer because none of the geometries in this repair touches s. Their intersection, therefore, is 
empty and differs from the answer obtained from the core. This difference is due to the fact that the query 
window s touches geometry g[ in only one of the repairs. □ 



5.2 Computing the Core 

We now give a characterization of the core of a database instance for a set of SICs of the form (|8]), which is 
not explicitly based on the computation of minimal repairs. This equivalent and alternative characterization 
of the core allows us to compute the core without having to compute all the minimal repairs. 

To simplify the notation, we introduce a logical formula that captures a conflict around a tuple of relation 
R e D and a SIC of the form ([8]) with topological predicate T: 

yXiX2SiS2{Conflj-, j^r^{xi,Si,X2,S2) = A R{X2]S2) AXi^X2 A T(5i, 52))). (11) 

Definition 12 Let be a database instance and a set of SICs of the form ([8]). For the core of D with 

respect to ^, it holds i?^ = Dfj^tersects n I^Mersects ^ ^EquaV ^herc: 

(a) DJ^ntersects = {^(^; Difference{g , t)) \ R{a; g)eD,t = [j{g' \ for every i?(6, g') G D such that D H 
Conflj-, ji jj^^^^g^^^g{a^ g^ 6, g')}}, where |J is the geomUnion operator that calculates the geometric 
union (spatial aggregation) of geometries. 
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(b) Dj^tersects = {R{a; Difference {g ,t)) \ R{a-g) e D,t = [j{Bujfer{g^ ,d) \ for every R{b,g^) G 
such that 1= Conflj^ ji j^t^^^^^t^(a,g,b,g')}}. 

(^) ^Equai = I R(o.,g) G L), it does not exists G h Conflj^j^EquaM^d^^^g))}- 

□ 

Notice that t is the union of all the geometries that are in conflict with a given geometry g. It is obtained by 
using the spatial aggregation operator geomUnion. 

Now, we give the specification of the cores: I^^^^er.ect.' DJj^tersects^ and D^^^^^^ as views in a spatial 
SQL language0 In the following specification, we assume a database instance D with a relational predicate 
R{id] geometry) and primary key id. The following specification shows that our methodologies could 
be implemented on top of current spatial database management systems. In particular, the definition of 
^Intersects ^^^^ ^ fixcd valuc d that represents the minimum distance between geometries in the cartographic 
scale of the database instance. The intersection of these views makes D^. 

Table |4] shows three views that enables to compute the core of the database with a database predicate 
R(idl] geometry). 

Example 16 (example [TOl cont.) The example considers only the relation LandP with primary key idl and 
the SIC ([3]) of Example[2l We want to consistently answer the query of Example fTTl i.e., 3name owner (LandP 
{idl^ name^ owner] geometry) A I Inter sects{geometry ^ {[xi^yi]^ [x2^yi]^ [^2, 1/2], [^i, ^2], [xi^yi])). 

To answer this query, we generate a view of the core applying the definition in Table IH That is, we 
eliminate from each geometry the union of conflicting regions with respect to each land parcel. In this case, 
the conflicting geometries for ^2 are g^ and ^4; for geometry g^ is g2\ and for geometry ^4 is ^2- This is the 
definition of the core in SQL: 



CREATE VIEW 


Core 


AS (SELECT 


li.idl AS idl, ll.name AS name, li. owner AS owner. 




difference (I I. geometry, geomunion(l2 .geometry)) AS geometry 


FROM 


LandP ASh, LandP AS h 


WHERE 


li.idl <> l2.idl AND Intersects (h .geometry , I2. geometry) AND 




NOT Touches (I I. geometry, I2. geometry) 


GROUP BY 


li.idl, ll.name, li. owner, 11. geometry 


UNION 




SELECT 


li.idl AS idl, ll.name AS name, li. owner AS owner, li. geometry AS geometry 


FROM 


LandP AS h 


WHERE 


NOT EXISTS(SELECT h.idl, h.geometry 




FROM LandP AS h 




WHERE li.idl <> l2-idl AND Intersects (li .geometry , I2 -geometry) AND 




NOT Touches (I I. geometry, I2. geometry))) 



We now can pose the query to the core to compute the consistent answer to the original query: 

current SQL Language IIntersects{gi, §2) — I nter sects{gi, g2) AND NOT Touches{gi, g2) = Overlaps{gi, g2) 
OR Within{gi, g2) OR Contains{gi, g2) OR T ouches {g i, g2). 

^^Optimizations to the SQL statements are possible by using materialized views and avoiding double computation of join opera- 
tions. 
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CRbAlh VlhW 


Core-Intersects 






ri Ad AS id, difference{r^ .geometry, 
Buffer(geomunion(r2 .geometry), d)) AS geometry 




FROM 


RAS ri,RAS r2 




WHERE 


ri .id <> r2.id AND 

Intersects{r 1 .geometry , r 2 -geometry) 


t t 


GROUP RY 




T TMT/^M 
UJNiUJN 






SELECT 


ri .id AS id, ri .geometry AS geometry 




fivvJiVl 






WrlxiKii 


iNUi JiAio 1 (oiiLiiL. 1 r 2 -Id 1 r 2 -geometry 
FROM R AS r2 

WHERE n .id <> r2-id AND 
Intersects (^r± .geometry , r2 -geometry))) 




r^TiJJ \'T^JJ \7JTJW7 


Core -Ilntersects 




AO/ OTIT Tl/^l'T' 

AS (SbLhCl 


■ J AO 'J J'£C / J. 

ri .za AS za, aijjerence{r 1 .geometry, 
geomunioni^r 2 -geometry)) AS geometry 




FROM 


OAO„ OAO„ 

it AS ri, it AS r2 




WHERE 


n <> r^.zd AND 

Intersects{r 1 .geometry, r2 -geometry) AND 
NOT Touches(ri .geometry, r2- geometry) 


JJ-k 

Ilntersects 


GROUP BY 


ri .id, ri .geometry 








OCT C'/^T 


AO 

r_z .id AS za, r_z .geometry AS geometry 






it /\0 7 1 




WTU'C'D'C 

WrlilRil 


JN(Jl LXiS 1 S (SLLLC 1 r2 -id, r2 -geometry 

Fivw iVl ih /\v3 / 2 

WHERE n .id <> r2-id AND 
Intersects{r 1 .geometry, r 2 -geometry) AND 




CREATE VIEW 


Core-Equal 




AS (SELECT 


ri .id AS id, ri .geometry AS geometry 




FROM 


RAS n 


^Equal 


WHERE 


NOT EXISTS (SELECT r2-id, r2 -geometry 




FROM AS r2 

WHERE ri .z(i <> r2-id AND 
Equals{ri .geometry , r 2 -geometry))) 



Table 4: SQL statements to compute views for DJ^^^^^^^^^, DJ^ntersects^ ^nd 



SELECT idZ, name, owner, geometry (12) 
FROM Core 

WHERE Intersects {geometry, {[xi,yi], [x2,yi], [x2,y2], [^^1,1/2], [^1,1/1])) 
The answer is shown in Figure [12] This query is a classic selection from the Core view. □ 

This core-based method allows us to compute consistent answers in polynomial (quadratic) time (in data 
complexity) in cases where there can be exponentially many repairs. In Example [H where we have 2^~^ 
minimal repairs, we can apply the query Q over the core, and we only have to compute the difference of 
a geometry with respect to the union of all other geometries in conflict. This corresponds to a polynomial 
time algorithm of order polynomial with respect to the size of the database instance. 
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6 Experimental Evaluation 



In this section we analyze the results of the experimental evaluation we have done of the core-based CQA 
using synthetic and real data sets. The experiment includes a scalability analysis that compares the cost of 
CQA with increasing numbers of conflicting tuples and increasing sizes of database instances. We com- 
pare these results with respect to the direct evaluation of basic conjunctive queries over the inconsistent 
database (i.e., ignoring inconsistencies). The latter reflects the additional cost of computing consistent an- 
swers against computing queries that ignore inconsistencies. 

6.1 Experimental Setup 

We create synthetic databases to control the size of the database instance and the number of conflicting 
tuples. We use a database schema consisting of a single predicate R{id; geometry), where id is the numeric 
key and geometry is a spatial attribute of type polygon. We create three sets of synthetic database instances 
by considering SICs of the form ^ with different topological predicates: 



Set 










SIC 


Equals 


V 






A R{X2]S2) 


A xi 7^ X2 A Equals {s 1 , S2)) 


Intersects 


V 






A R{X2]S2) 


A xi X2 A Intersects (s i , S2)) 


Ilntersects 


V 






A R{X2]S2) 


A x\^X2 A Ilntersects {s 1 ^ S2)) 



For each set we create five consistent instances including 5,000, 10,000, 20,000, 30,000, and 40,000 tuples 
of homogeneously distributed spatial objects whose geometries are rectangles (i.e., 5 points per geometric 
representation of rectangles). Then, we create inconsistent instances with respect to the corresponding SICs 
in each set with 5%, 10%, 20%, 30%, and 40% of tuples in conflict. For database instances with a SIC and 
topological Equals, we create inconsistencies by duplicating geometries in a percentage of geometries. For 
database instances with a SIC and topological Ilntersects, we create inconsistencies by making geometries 
overlap. Finally, for database instances with a SIC and topological Intersects, we create inconsistencies by 
making a percentage of geometries to touch. 

Due to the spatial distribution of rectangles in the sets, the cores for database instances with SICs using 
topological predicates in {Intersects^ Ilntersects} have the same numbers of points in their geometric 
representations than their original instances. For the set of database instances with SICs using topological 
predicate Equals, the numbers of points in the geometric representations of their cores are less than in the 
original databases, because we eliminate geometries as we restore consistency. Thus, we are not introducing 
additional storage costs in our experiments. 

To have a better understanding of the computational cost of CQA, we also evaluate the cost of CQA over 
real and free available data of administrative boundaries of Chile [I j. Chilean administrative boundaries have 
complex shapes with many islands, specially, in the South of Chile (e.g., a region can have 891 islands). For 
the real database, we have two predicates Counties and Provinces. Notice that, at the conceptual label. 
Provinces are aggregations of Counties. In this experiment, however, we have used the source data as it 
is, creating separated tables for Counties and Provinces with independent spatial attributes. For this real 
database, we consider SIC of the form: V 51 ) A R{x2]S2) A x\ ^ X2 A Ilntersects {s ^ ^ S2)), 

with R being Counties or Provinces. 

Table [5] summaries the data sets for the experimental evaluation. The percentage of inconsistency is cal- 
culated as the number of tuple in any conflict over the total number of tuples. The geometric representation 
size is calculated as the number of points in the boundaries of a region. 
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Source 


Name 


Tuples 


Inconsistency (%) 


Geometric representation size 


Synthetic 


Equals 


5,000-40,000 


5-40 


25,000-200,000 




Ilntersects 


5,000-40,000 


5-40 


25,000-200,000 




Intersects 


5,000-40,000 


5-40 


25,000-200,000 


Real 


Provinces 


52 


59 


35,436 




Counties 


307 


12.7 


72,009 



Table 5: Data sets of the experimental evaluation 



We measure the computational cost in terms of seconds needed to compute the SQL statement on a Quad 
Core Xeon X3220 of 2.4 GHz, 1066 MHz, and 4 GB in RAM. We use as spatial DBMS PostgreSQL 8.3.5 
withPostGIS 1.3.5. 

6.2 Experimental Results 

Figure [19] shows the cost of the core computation for the different synthetic database instances. To make 
this experimental evaluation easier and faster, we used materialized views so that we computed only once 
the core and applied queries on this core's view. However, we added the computational cost of the core to 
each individual query result to have a better understanding of the cost of applying CQA. 

The time cost of computing the core for inconsistent databases with respect to a SIC with a topological 
predicate Equals decreases as the number of tuples in conflicts increases, since the core eliminates geome- 
tries in conflict and, therefore, these empty geometries are then ignored in geometric computations. The cost 
of computing the core is largely due to the spatial join given by the topological predicate of a SIC, which 
could decrease using more efficient algorithms and spatial indexing structures. 



Time cost of core computation 



- <T>---gD--- ®--^a)Sooo ^ 



5 10 20 30 40 5 10 20 30 40 

% inconsistency 



5 10 20 30 40 



Figure 19: Time cost of the core computation for different SICs, different levels of inconsistency, and 
different sizes of databases instances 



For the synthetic database instance. Figures |20] and |2T] show the cost rate between computing a CQA 
with respect to simple range or join queries (with the spatial predicate Intersects) that ignore inconsistencies. 
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Range queries use a random query window created by a rectangle whose side is equivalent to 1% of the total 
length in each dimension. Notice that the time cost of computing a range query for a database instance 
with 10,000 was approximately 15 ms, which, in average, was 900 times less than computing a join query. 
These reference values exhibit linear and quadratic growth for range and join queries, respectively, as we 
consider increasing sizes of database instances. The computational cost of CQA to join queries include the 
computation of the core; however, this cost could be amortized if we use a materialized view of the core 
for computing more than one join query. In the time cost of CQA for range queries, we have optimized 
the computation by applying the core-computation over a subset of tuples previously selected by the query 
range. This optimization is not possible for join queries, since no spatial window can constrain the possible 
geometries in the answer. 



Relative CQA cost to range queries 
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Figure 20: Relative cost of CQA to range queries 



The results indicate that CQA to a range query can cost 100 times the cost of a simple query. This is 
primarily due to the join computation of the core. Indeed, when comparing the CQA to a join query, we 
only duplicate the relative cost, and in the best case, keep the same cost. However, join queries have a 
significant larger computational cost. Notice that the computation cost for a CQA to range query is around 
60s in the worst case (40,000 tuples). With exception of cases when the core contains empty geometries, the 
percentage of inconsistencies does not affect drastically the results. 

We also evaluate the scalability of the CQA cost to range queries in function of the size of the query 
window (i.e., spatial window). In Figure [22]we show the relative CQA cost to range queries on a synthetic 
database instance with 10,000 tuples and range queries whose random spatial windows varied from 1% to 
5% of the size in each dimension. The results indicate that the relative cost increases logarithmical as we 
increase the size of the query window. Also, only for database instances with a SIC and topological predicate 
Equals, the relative cost suffers some variation across different percentages of inconsistencies, primarily, 
due to the elimination of geometries in the database. 

Finally, we applied the core-based computation of CQA to the real database instances in Table |5] Table |6] 
summaries the results obtained with these data, which were in agreement with the results obtained with the 
synthetic database instances. In this table, APoints represents the relative difference in the size of the geo- 
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Figure 21: Relative cost of CQA to join queries 



Escalability of relative CQA cost to range queries 
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Figure 22: Relative cost of CQA to range queries and different sizes of the query window (using a database 
instance with 10,000 tuples ) 



metric representation between the core and the original database. Notice that computing the core increased 
the geometric representation of Provinces up to 5.0%, which is bounded by the shape of geometries in 
conflict (i.e., the size of the original geometric representation). In the case of Counties, however, the size 
of the geometric representation of the core decreases down to —0.03%. Since the geometry of provinces 
should be the geometric aggregation of counties, we could expect to have a relationship between A Points 
for Provinces and Counties. However, the source data set uses independent geometries for Provinces and 
Counties and no comparison can be made. 
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Ranj 




Join 


Data 


A Points 


Core 


Simple 


CQA 


Simple CQA 


Provinces 


+5.0% 


17.7 


0.04 


0.25 


29.8 63.4 


Counties 


-0.03%0 


18.1 


0.1 


2.1 


40.6 55.7 



Table 6: CQA cost with real data (costs of core and queries in seconds) 



7 Conclusions 

We have formalized a repair semantics and consistency query answers for spatial databases with respect 
to SICs. The repair semantics is used as an auxiliary concept for handling inconsistency tolerance and 
computing consistent answers to spatial queries. It is based on updates that shrink geometries of objects, 
even at the point of deleting geometries for some exceptional cases, as for predicate Disjoint. Geometries 
are virtually updated applying admissible geometric operators, which are available in most spatial DBMSs. 

By restricting ourselves to the application of the admissible transformations, we have a finite number of 
possibilities for making a pair of geometries consistent with respect to a SIC. However, there may still be 
exponentially many repairs for a given instance and set of SICs. With the purpose of avoiding to compute 
and query all repairs, we have identified cases of SICs and conjunctive (range and join) queries where the 
consistent answers can be obtained by posing a standard query to a single view of the original instance. This 
view is equivalent to the intersection of all possible minimal repairs, what we called the core of a database 
instance, which for a subset of SICs can be computed in polynomial time without determining each repair. 

An experimental evaluation of the core-based computation of CQA reveals that answering range queries 
has a cost that varies drastically in function of the topological predicates in SICs and the number of tuples 
in the database instance, reaching up to 100 times the cost of a simple range query. This is mainly due to 
the spatial join involved in computing the core. For join queries, instead, the cost of CQA is the double 
of a simple join query. These results do not use optimizations with spatial indexing, which has been left 
for future work. Even more, they assume that we have to compute the core for each query, which could be 
optimized by using materialized views. 

This work leaves many problems open. Most prominently, computability and complexity issues have 
to be explored. For example, some interesting decision problems are deciding if non trivial repairs (i.e., 
not obtained by cancellation of geometries) exist for an instance and a set of SICs, or deciding whether or 
not a particular instance is a repair of an inconsistent database instance. The complexity of deciding if a 
spatio-relational tuple is a consistent answer is also open. As in the relational case, we expect to find hard 
cases for all these problems. For them, it would be interesting to obtain lower complexity approximation 
algorithms. 

We have considered only regions to represent spatial objects. A natural extension of this work would 
be to define a repair semantics for other spatial abstractions, such as polylines, points, networks, and so 
on. We would also like to explore not only denial SICs, but also other classes of semantic ICs, and other 
types of repair semantics that include solving conflicts with respect to a topological predicate Disjoint. 
This includes also the possibility of considering combinations of spatial with relational constraints, e.g. 
functional dependencies and referential ICs. 

Acknowledgments 

This project is partially funded by FONDECYT, Chile, grant number 1080138. Part of this research was 
done when L. Bertossi was invited to the Universidad de Concepcion and Universidad del Bio-Bio. Monica 



34 



Caniupan received funding from FONDECYT, Chile, grant number 11070186. Leopoldo Bertossi has also 
been partially funded by an NSERC Discovery Grant (#315682). 



References 

Albers, C. Gisdata chile, administrative boundaries since 2007. 

http:// www.rulamahue.cl/mapoteca/catalogos/chile.html[ 2009. 

Arenas, M., Bertossi, L., and Chomicki, J. Consistent query answers in inconsistent databases. 
In 18th ACM Symposium on Principles of Database Systems PODS'99 (1999), ACM Press, pp. 68-79. 

Arenas, M., Bertossi, L., and Chomicki, J. Scalar aggregation in fd-inconsistent databases. In 
Proc. International Conference on Database Theory (2001), Springer LNCS 1973, pp. 39-53. 

Bertossi, L. Consistent query answering in databases. ACM Sigmod Record 35, 2 (2006), 68-76. 

Bertossi, L., and Bravo, L. Consistent query answers in virtual data integration systems. In 
Inconsistency Tolerance (2005), vol. 3300 of Lecture Notes in Computer Science, Springer, pp. 42-83. 

Bertossi, L., and Chomicki, J. Logics for Emerging Applications of Databases. Springer, 2003, 
ch. Query Answering in Inconsistent Databases, pp. 43-83. 

Bertossi, L., Hunter, A., and Schaub, T., Eds. Inconsistency Tolerance (2005), vol. 3300 of 
Lecture Notes in Computer Science, Springer. 

BORGES, K., Laender, A., AND DAVIS, C. Spatial integrity constraints in object oriented geo- 
graphic data modeling. In ACM International Symposium on Advances in GIS (1999), ACM Press, 
pp. 1-6. 

Caniupan, M. Optimizing and Implementing Repair Programs for Consistent Query Answering in 
Databases. PhD thesis, Carleton University, Department of Computer Science, 2007. 

Chomicki, J. Consistent query answering: Five easy pieces. In International Conference on 
Database Theory ICDTV? (2007), vol. LNCS 4353, Springer- Veriag, pp. 1-17. 

COCKCROFT, S. A taxonomy of spatial integrity constraints. Geolnfomatica 1, 4 (1997), 327-343. 

DUCKHAM, M., LiNGHAM, J., MASON, K. T., AND WORBOYS, M. F. Qualitative reasoning about 
consistency in geographic information. Inf. Sci. 176, 6 (2006), 601-627. 

Egenhofer, M., Clementine, E., and Felice, P. D. Evaluating inconsistency among multiple 
representations. In Spatial Data Handling (1995), pp. 901-920. 

Egenhofer, M., and Sharma, J. Assessing the consistency of complete and incomplete topolog- 
ical information. Geographical Systems 1 (1993), 47-68. 

Egenhofer, M. J., and Franzosa, R. D. Point set topological relations. International Journal of 
Geographical Information Systems 5 (1991), 161-174. 



35 



[16] Franconi, E., Palma, a. L., Leone, N., Perri, S., and Scarcello, F. Census data repair: 
a challenging application of disjunctive logic programming. In LPAR (2001), R. Nieuwenhuis and 
A. Voronkov, Eds., vol. 2250 of Lecture Notes in Computer Science, Springer, pp. 561-578. 

[17] FuxMAN, A., Fazli, E., and Miller, R. J. Conquer: Efficient management of inconsistent 
databases. In SIGMOD Conference (2005), R Ozcan, Ed., ACM, pp. 155-166. 

[18] GUTING, R. H. Graphdb: Modeling and querying graphs in databases. In VLDB (1994), J. B. Bocca, 
M. Jarke, and C. Zaniolo, Eds., Morgan Kaufmann, pp. 297-308. 

[19] GuTiNG, R. H., AND Schneider, M. Realm-based spatial data types: The rose algebra. VLDB J. 4, 
2 (1995), 243-286. 

[20] Hadzilacos, T., and Tryfona, N. a model for expressing topological integrity constraints in geo- 
graphic databases. In Spatio-Temporal Reasoning (1992), A. U. Frank, I. Campari, and U. Formentini, 
Eds., vol. 639 of Lecture Notes in Computer Science, Springer, pp. 252-268. 

[21] KuiJPERS, B., Paredaens, J., AND DEN BUSSCHE, J. V. On topological elementary equivalence 
of spatial databases. In ICDT (1997), F. N. Afrati and R G. Kolaitis, Eds., vol. 1186 of Lecture Notes 
in Computer Science, Springer, pp. 432-446. 

[22] Mas, S. Reasoning on spatial semantic integrity constraints. In COSIT (2007), S. Winter, M. Duck- 
ham, L. Kulik, and B. Kuipers, Eds., vol. 4736 of Lecture Notes in Computer Science, Springer, 
pp. 285-302. 

[23] OpenGis. Opengis simple features specification for sql. Tech. rep.. Open GIS Consortium, 1999. 

[24] Paredaens, J., den Bussche, J. V., and Gucht, D. V. Towards a theory of spatial database 
queries. In PODS (1994), ACM Press, pp. 279-288. 

[25] Paredaens, J., and Kuijpers, B. Data models and query languages for spatial databases. Data 
Knowl. Eng. 25, 1-2 (1998), 29-53. 

[26] Randell, D. a., Cui, Z., and Cohn, A. G. A spatial logic based on regions and connection. In 
KR {1992), pp. 165-176. 

[27] RODRfcUEZ, M. A. Inconsistency issues in spatial databases. In Inconsistency Tolerance (2005), 
vol. 3300 of Lecture Notes in Computer Science, Springer, pp. 237-269. 

[28] RODRfGUEZ, M. A., Bertossi, L., and Caniupan, M. An inconsistency tolerant approach to 
querying spatial databases. In ACM-GIS (2008), W. G. Aref, M. Mokbel, and M. Schneider, Eds., 
ACM. 

[29] Service, S., Puricelli, T., and Laurini, R. A methodology for spatial consistency improvement 
of geographic databases. Geolnformatica 4 (2000), 7-24. 

[30] Thompson, D., and Laurini, R. Fundamentals of Spatial Information Systems. No. 37 in APIC. 
Academic Press, 1992. 

[31] Tryfona, N., and Egenhofer, M. Consistency among parts and aggregates: A computational 
model. Transactions on GIS 1,1 (1997), 189-206. 



36 



[32] WlJSEN, J. Database repairing using updates. ACM Trans. Database Syst. 30, 3 (2005), 722-768. 



37 



9l 



92 



9l 



92 



